The University of Arizona
Map Home
Loading...
Adjust height of sidebar
KMap

Topic:corpus creation

corpus creation

Since 2021, aggregated from related topics

About

    Corpus creation is the process of compiling and organizing a large collection of texts or spoken language samples for the purpose of linguistic analysis and research. Corpora can be created from a variety of sources, such as written texts, transcribed conversations, social media posts, and more. Researchers typically collect and annotate these texts in order to study patterns of language use, investigate linguistic phenomena, train natural language processing models, and more. Corpora are an essential resource for linguists, computational linguists, lexicographers, and other language researchers.

Related Topics

People

View more people